Korean TimeML and Korean TimeBank

نویسندگان

  • Young-Seob Jeong
  • Won-Tae Joo
  • Hyun-Woo Do
  • Chae-Gyun Lim
  • Key-Sun Choi
  • Ho-Jin Choi
چکیده

Many emerging documents usually contain temporal information. Because the temporal information is useful for various applications, it became important to develop a system of extracting the temporal information from the documents. Before developing the system, it first necessary to define or design the structure of temporal information. In other words, it is necessary to design a language which defines how to annotate the temporal information. There have been some studies about the annotation languages, but most of them was applicable to only a specific target language (e.g., English). Thus, it is necessary to design an individual annotation language for each language. In this paper, we propose a revised version of Koreain Time Mark-up Language (K-TimeML), and also introduce a dataset, named Korean TimeBank, that is constructed basd on the K-TimeML. We believe that the new K-TimeML and Korean TimeBank will be used in many further researches about extraction of temporal information.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

KTimeML: Specification of Temporal and Event Expressions in Korean Text

TimeML, TimeBank, and TTK (TARSQI Project) have been playing an important role in enhancement of IE, QA, and other NLP applications. TimeML is a specification language for events and temporal expressions in text. This paper presents the problems and solutions for porting TimeML to Korean as a part of the Korean TARSQI Project. We also introduce the KTTK which is an automatic markup tool of temp...

متن کامل

TRIOS-TimeBank Corpus: Extended TimeBank Corpus with Help of Deep Understanding of Text

TimeBank (Pustejovsky et al, 2003a), a reference for TimeML (Pustejovsky et al, 2003b) compliant annotation, is widely used temporally annotated corpus in the community. It captures time expressions, events, and relations between events and event and temporal expression; but there is room for improvements in this hand-annotated widely used TimeBank corpus. This work is one such effort to extend...

متن کامل

TimeBank-Driven TimeML Analysis

The design of TimeML as an expressive language for temporal information brings promises, and challenges; in particular, its representational properties raise the bar for traditional information extraction methods applied to the task of text-to-TimeML analysis. A reference corpus, such as TimeBank, is an invaluable asset in this situation; however, certain characteristics of TimeBank—size and co...

متن کامل

Analysis of TimeBank as a Resource for TimeML Parsing

We present an analysis of the TimeBank corpus—the only available reference for TimeML-compliant annotation—from the point of view of its utility as a training resource for developing automated TimeML annotators. Experimental results indicative of the potential of TimeBank are encouraging; at the same time, closer inspection of causes for some systematic errors shows certain deficiencies in the ...

متن کامل

A Hidden Contributor to the Korean Miracle: The Korean Credit :union: Movement

Korean credit :::union:::s (CUs) are considered to be a hidden contributor to the “Korean miracle”, characterized by remarkable economic growth and relatively low income inequality. The Korean miracle not only generated wealth in an economically strapped and socially under-privileged people, but also contributed to regional community development and the democratization of Korean society. In...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016